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ABSTRACT 


This  report  describes  the  results  obtained  during  the  second  year  of  a 
program  of  continuing  research  in  the  mathematical  problems  associated  with 
the  analysis  and  design  of  Air  Force  sensor  correlation  and  data  fusion  sys¬ 
tems.  These  systems  play  a  vital  role  in  the  command  and  control  process,  but 
there  presently  exists  no  systematic  and  quantitative  methodology  for  their 
analysis  and  design.  In  the  first  year  of  research,  ALPHATECH  investigated  an 
important  subproblem:  the  .distributed  detection  problem  associated  with  de¬ 


termining  the  presence  or  absence  of  targets  from  a  collection  of  distributed 
sensors.  In  the  second  year  of  research,  ALPHATECH  has  obtained  novel,  exact 
expressions  for  the  probability  density  functions  of  the  local  log  likelihood 
ratios,  and  has  used  these  expressions  to  generate  an  extensive  set  of  design 
curves.  In  the  third  year  of  research,  which  is  presently  on-going,  we  sr« 
investigating  another  important  class  of  problems,  namely,  sequential  distrib¬ 
uted  detection  problems. 
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SECTION  1 
INTRODUCTION 

The  purpose  of  a  military  command,  control,  and  communications  (C^)  sys¬ 
tem  is  to  provide  timely  and  accurate  information  to  commanders.  An  important 
component  of  this  information  is  the  order  of  battle,  i.e.,  the  location  and 
status  of  forces.  Military  sensor  correlation  and  data  fusion  systems  exist 
primarily  to  determine  the  order  of  battle. 

There  is  at  present  no  adequate,  quantitative  theoretical  framework  for 
analysis  and  design  of  sensor  correlation  and  data  fusion  systems.  This  is 
because  these  systems  are  large  scale:  many  thousands  of  individual  targets 
must  be  kept  track  of  and  aggregated  into  organizational  units;  they  are 
stochastic:  observations  are  uncertain  both  due  to  noise  and  random  errors  as 
well  as  deliberate  enemy  deception;  and  they  are  distributed:  a  myriad  of 
geographically  separated  sensor  types  (including  humans)  is  present  that  are 
Interconnected  by  communication  links  with  capacity  limited  for  both  technical 
and  security  reasons. 

Thus,  extensions  to  the  classical  theories  of  centralized  signal  proces¬ 
sing  and  detection  are  needed  to  guide  the  design  of  sensor  correlation  and 
data  fusion  systems,  and  that  is  the  purpose  of  the  research  reported  herein. 

In  Section  2,  we  briefly  survey  the  previous  research  on  distributed 
detection.  In  Section  3,  we  recapitulate  the  new  results  obtained  under  the 
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second  year  of  AFOSR  sponsorship.  These  results  are  discussed  in  much  greater 
detail  in  Appendix  A,  "Distributed  Detection  of  Signal  Waveforms  in  Additive 


Gaussian  Measurement  Noise. 
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SECTION  2 

REVIEW  OF  PREVIOUS  RESEARCH 


The  problem  of  constructing  decentralized  hypothesis  testing  rules  has 
recently  been  introduced  and  studied  in  the  framework  of  team  decision  theory 
[2],  [3].  In  this  framework,  Tenney  and  Sandell  [1]  considered  the  first  sim¬ 
ple  distributed  detection  problem  where  there  are  two  hypotheses,  denoted  0  or 
1,  and  two  detectors.  In  their  formulation,  Tenney  and  Sandell  assumed  that 
the  detectors  have  a  common  objective,  (i.e.,  their  detection  problems  are 
coupled  through  the  costs)  and  each  detector  takes  one  measurement  (or  a  set 
of  measurements)  and  makes  a  decision  based  only  on  his  oim  information.  The 
measurements  of  the  detectors  are  assumed  independent  conditioned  on  the  hy¬ 
pothesis.  Under  these  assumptions,  it  was  shown  [1]  that  the  team  optimal 
strategies  of  the  two  detectors  are  described  by  thresholds  which  are  deter¬ 
mined  by  the  solution  of  two  coupled  nonlinear  algebraic  equations. 

Lauer  and  Sandell  {4]  -  [7]  extended  the  results  of  [1]  to  the  case  of 
correlated  waveform  observations.  They  found  that  in  general  the  determina¬ 
tion  of  the  optimal  decision  rules  of  the  two  detectors  requires  the  solution 
of  coupled  nonlinear  functional  equations.  Then,  Lauer  and  Sandell  examined 
several  special  cases  and  suboptlmal  approaches.  For  the  special  case  of  de¬ 
tecting  linearly  dependent  signals  in  white  noise,  they  determined  that  the 
local  likelihood  ratio  is  a  sufficient  statistic  for  detections  and  they  com¬ 
puted  numerical  examples  for  the  case  where  the  signal  is  a  random  process. 


3 


ALPHATECH,  INC 


They  examined  a  suboptimal  solution  consisting  of  local  likelihood  ratio  tests 
with  jointly  optimized  thresholds,  and  obtained  results  for  a  number  of  inter¬ 
esting  cases. 

Ekchian  [8]  considered  a  problem  similar  to  that  of  Tenney  and  Sandell 
[1]  but  assumed  in  addition  that  a  unidirectional  communication  link  exists 
between  the  two  detectors.  Ekchian  found  that  the  team  optimal  decision  rules 
of  the  two  detectors  are  described  by  thresholds,  the  computations  of  which 
are  coupled.  In  addition,  he  found  that  the  detector  receiving  the  communica¬ 
tion  uses  one  of  two  thresholds  depending  on  the  decision  of  the  other  detec¬ 
tor. 

The  work  on  distributed  detection  reported  in  [1]  and  [4]  -  [8]  assumes  a 
model  with  static  hypotheses  (i.e.,  the  true  hypothesis  does  not  change  with 
time)  and  static  observations  (i.e.,  the  detectors  take  one  measurement  or  a 
set  of  measurements  and  make  a  decision). 

Teneketzis  [9]  considered  a  distributed  detection  problem  with  static  hy¬ 
potheses  and  dynamic  observations  (i.e.,  at  each  Instant  of  time,  each  detec¬ 
tor  can  either  stop  and  make  a  decision  or  request  more  information  at  some 
cost).  Teneketzis  [9]  formulated  a  finite  horizon  decentralized  optimal  stop¬ 
ping  problem  with  two  hypotheses,  two  detectors,  and  time  which  is  the  decen¬ 
tralized  version  of  Wald's  problem.  He  found  that  the  optimal  decision  rules 
of  the  two  detectors  are  described  by  thresholds.  The  thresholds  of  the  two 
detectors  are  time  varying  and  coupled  and  are  determined  by  the  solution  of 
4N-2  nonlinear  algebraic  equation  in  4N-2  unknowns,  where  N  is  the  number  of 
observations. 

Subsequently,  Teneketzis  and  Varalya  [10]  solved  a  distributed  detection 
problem  with  dynamic  events  (i.e.,  the  case  in  which  the  true  hypothesis 
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changes  with  time)  and  dynamic  observations.  They  formulated  an  infinite  ho¬ 
rizon  decentralized  optimal  stopping  problem,  with  two  hypotheses  and  two  de¬ 
tectors,  which  is  the  decentralized  version  of  a  quickest  detection  problem. 
They  found  that  the  optimal  decision  rules  of  the  two  detectors  are  described 
by  thresholds  which  can  be  determined  by  the  solution  of  two  coupled  dynamic 
programming  equations. 

Recently,  Kushner  and  Pacut  [11]  studied  a  decentralized  detection  and 
coordination  problem  via  simulation.  They  consider  two  hypotheses,  0  or  1, 
and  two  detectors.  Each  detector  takes  an  observation  at  time  1  and  may,  if 
it  wishes,  take  an  observation  at  time  2.  The  second  observation  costs  C.  The 
detectors  do  not  communicate  with  each  other.  At  the  end  of  its  "obser¬ 
vation  period"  each  detector  transmits  its  conditional  probabilities  of  the 
hypotheses  to  a  coordinator  who  then  computes  the  posterior  probability  and 
decides  on  either  0  or  1.  Kushner  and  Pacut  [11]  investigate  the  effects  of 
prior  probability  and  parametric  dependencies  on  the  decision  rules,  as  well 
as  sensitivity  to  the  data,  asjmmetries  in  the  design  rules  and  other 
phenomena. 


ALPHATECH,  INC 


I 


a 


I 

I 


m 


SECTION  3 

REVIEW  OF  NEW  RESULTS 


The  key  issue  addressed  in  the  second  year  of  our  research  emerged  in  the 
process  of  constructing  Receiver  Operating  Characteristics  (ROCs)  for  the 
distributed  detection  laws  derived  during  the  first  year  [4]  -  [7].  In  con¬ 
structing  these  ROCs,  a  Gaussian  approximation  to  the  local  likelihood  ratios 
is  inadequate  since  the  interesting  portions  of  the  ROCs  are  generated  by 
integrating  over  the  tails  of  the  probability  distributions  of  the  local  like¬ 
lihood  ratios.  In  the  course  of  an  effort  to  derive  bounds  on  the  detection 
and  false  alarm  probabilities,  several  remarkable  exact  formulas  for  these 
probabilities  were  obtained.  These  formula  are  novel  even  in  the  classical 
centralized  case. 

The  basic  difficulty  can  be  seen  from  the  equation 

T  T 

~  ~  I  I  yi(t)hi(t,u)yi(u)dudt  (3-]) 

0  0 

for  the  1-th  local  likelihood  ratio  which  expresses  this  quantity  as  a 
quadratic  form  in  the  Gaussian  observation  process  yi(t).  (The  kernel  hjCt.u) 
is  the  solution  of  a  certain  integral  equation.)  Thus  the  are  emphatically 
non-Gaussian. 

Surprisingly,  the  following  exact  expressions  can  be  derived  for  the 
under  the  hypotheses  (no  signal  present)  and  (signal  present): 
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where  p,  Yl,  Y2»  3^^  ^  sre  certain  parameters  defined  in  terms  of  the  statis¬ 
tics  of  the  observation  process  yi(t),  r(*)  is  the  Gamma  function,  and  I^C*) 
is  the  modified  Bessel  function  of  order  v. 

The  probability  density  functions  p(£i,£2|H*^)  and  p(£l,£2|H^)  can  then  be 
analytically  integrated  to  determine  the  ROCs.  Full  details  are  contained  in 
[12],  which  is  included  in  this  report  as  Appendix  A.  Thus,  for  the  specific 
observation  process  defined  precisely  in  [12],  we  have  been  able  to  derive 
exact  formulas  for  the  detection  and  false  alarm  probabilities,  and  have  used 
these  formulas  to  construct  design  curves  for  distributed  surveillance 
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ABSTRACT 


This  paper  is  concerned  with  the  detection  of  signal  waveforms  by  a  dis¬ 
tributed  surveillance  network  comprised  of:  (1)  a  collection  of  spatially 
separated  sensors,  and  (2)  local  signal  processors  collocated  with  the  sen¬ 
sors.  The  local  signal  processors  are  assumed  to  implement  likelihood  ratio 
tests  to  detect  the  presence  or  absence  of  the  signals.  Signal  detections  may 
be  used  for  local  decisionmaking  or  passed  upward  to  a  fusion  center  for  fur¬ 
ther  processing.  In  either  case,  the  local  detection  thresholds  cannot  be 
determined  independently,  but  must  be  determined  jointly  to  optimize  overall 
surveillance  system  performance.  Results  are  presented  concerning  the  nature 
of  this  threshold  computation  for  a  number  of  interesting  cases. 
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SECTION  1 
INTRODUCTION 

Classical  detection  theory  (see,  e.g.  ,  tl]~l4]*)  has  been  motivated  pri¬ 
marily  by  single  sensor  detection  problems.  Although  the  signal  processing 
solutions  of  classical  detection  theory  are  in  principle  equally  applicable  to 
multiple  sensor  detection  problems,  in  practice  these  solutions  may  require 
the  communication  of  raw  received  signals  from  physically  remote  sensors  to  a 
central  processing  location.  In  many  surveillance  systems,  particularly  mili¬ 
tary  systems,  such  communication  capability  is  unavailable  for  reasons  of 
cost,  reliability,  bandwidth,  survivability,  security,  and  similar  factors. 

In  practice,  the  sensors  and  associated  local  signal  processors  in  dis¬ 
tributed  surveillance  networks  tend  to  be  designed  relatively  independently  of 
one  another,  with  communication  restricted  to  higher  level  signal  characteris¬ 
tics  (e.g.,  reports  of  emitter  detections).  While  such  practice  has  the  vir¬ 
tue  of  simplicity,  there  is  a  potential  loss  in  performance  due  to  considering 
each  sensor  Individually,  rather  than  as  an  element  in  an  overall  surveillance 
system. 

Motivated  by  such  considerations,  Tenney  and  Sandell  formulated  and 
solved  a  number  of  distributed  detection  problems  (with  random  variable  obser¬ 
vations)  in  [5].  From  a  technical  point-of-view,  these  problems  were  problems 

*References  are  indicated  by  numbers  in  square  brackets  and  appear  at  the  end 
of  this  paper. 
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In  team  theory  [6]  or  decentralized  control  theory  [7],  Subsequently,  a  num¬ 
ber  of  authors  have  generalized  the  formulations  in  [5]  in  various  directions 
[8]-[12];  however,  all  have  considered  only  the  case  of  random  variable 
observations. 

In  this  paper,  we  generalize  the  distributed  detection  theory  results  of 
[5]  to  the  case  of  waveform  (i.e.,  stochastic  process)  observations.  Thus  we 
will  have  two  hypotheses  to  test, 

1.  signal  is  absent,  and 

2.  :  signal  is  present. 

Corresponding  to  these  two  hypotheses,  we  have  the  potential  decisions  ui=0 
(H®  is  true)  and  u^®!  (H^  is  true)  made  at  the  i-th  local  sensor.  The  cost  of 
decision  errors  is  measured  by  a  cost  function  JCu^.u^.H)  (for  the  case  of  two 
sensors);  it  is  desired  to  choose  local  decisions  u^  on  the  basis  of  locally 
available  observations  y^Ct)  to  minimize  the  expected  value  of  J. 

Note  that  in  the  special  case  of  a  surveillance  system  in  which  the  deci¬ 
sions  Ui  are  transmitted  to  a  central  fusion  center  (Fig.  1-1)  where  a  final 
decision  is  made,  the  cost  J  has  the  special  form 

J(uj,U2,H)  =  J’(f(u^,U2),H)  (1-1) 

where  J'(u,H)  is  the  global  decision  cost  and  f(u^,U2)  is  the  (fixed)  fusion 
law  (e.g.  ,  voting)  used  to  determine  u  from  u^  and  u^. 

In  the  sequel,  we  will  assume  that  the  local  signal  processing  imple¬ 
mented  at  the  i-th  sensor  consists  of  the  following  two  steps:* 


*As  will  be  noted  subsequently,  only  in  special  cases  is  it  in  fact  optimal  to 
base  the  detection  decision  on  the  local  likelihood  ratio. 
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1.  Determine  the  local  log-likelihood  ratio  and 

2.  Implement  the  test* 

> 

Ai  <  Ti  .  (1-2) 


We  will  then  determine  the  optimal  thresholds 

In  Section  2,  we  consider  the  case  in  which  each  yi(t)  is  either  a  known 
signal  plus  noise  (H^ ) ,  or  just  noise  (H®).  In  the  special  cases  in  which: 

(1)  the  noise  is  uncorrelated  between  sensors,  and  (2)  the  noise  is  correlated 
between  sensors  but  is  white  and  the  signals  are  linearly  dependent,  we  are 
able  to  obtain  sufficient  statistics  for  local  decisionmaking.  The  technique 
Involves  expanding  the  observation  processes  into  a  Karhuren-Loeve  series. 
Numerical  results  are  provided  for  the  case  of  detection  of  a  sinusoidal  sig¬ 
nal  in  white  Gaussian  noise.  Numerical  results  are  presented  for  both  the 
general  Bayesian  case  and  for  the  case  with  a  centralized  fusion  center. 

In  Section  3  we  consider  the  case  in  which  the  signal  is  a  Guasslan  sto¬ 
chastic  process.  Results  are  more  difficult  to  obtain  here,  since  the  log- 
likelihood  ratio  is  a  nonlinear  (quadratic)  function  of  the  observations  even 
in  the  centralized  case.  However,  in  the  Important  special  case  of  an  ideal 
band  limited  signal  in  white  noise,  we  are  able  to  obtain  a  closed  form 
(albeit  rather  complex)  expression  for  the  joint  probability  density  function 
of  the  local  log-likelihood  ratios.  This  expression  permits  us  to  solve  for 

*The  notation  means  "choose  if  A^^Ti,  and  choose  H®  if  £i<Ti." 
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the  optimal  distributed  detection  thresholds.  Numerical 
for  both  the  general  Bayesian  case  and  for  the  case  with 
center. 

Section  4  contains  a  summary,  some  conclusions,  and 
future  research. 


results  are  presented 
a  central  fusion 

suggestions  for 
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SECTION  2 

KNOWN  SIGNAL  IN  NOISE 

In  this  section  we  consider  a  distributed  detection  problem  in  which  the 
signals  are  known  deterministic  waveforms  and  the  noise  processes  are  colored 
Gaussian  processes.  As  stated  earlier,  we  will  restrict  attention  to  the  case 
of  two  detectors  each  of  which  must  make  a  binary  decision  based  on  local 
observations . 

2.1  PROBLEM  FORMULATION  AND  PRELIMINARY  ANALYSIS 

We  assume  that  there  are  two  sensors,  indexed  by  i=l  ,2  and  that  there  are 
two  hypotheses  to  be  tested  based  on  the  sensor  observations.  The  observa¬ 
tions  under  the  two  hypotheses  are  modeled  by  the  equations 

yi(t)  =  -^Ei  si(t)  +  ni(t)  To^tcTf 

(2-1) 

H®:  yi(t)  «  ni(t)  To<t<Tf  . 

We  assume  that  the  s^Ct)  are  known  signals  with  unit  energy  and  are  zero  out¬ 
side  the  interval  10,T]  where  To<0<T<Tf.  The  ni(t)  are  assumed  to  be  zero- 
mean  Gaussian  processes  where 

E{ni(t)nj(T)}  =  Kij(t,T)  ,  i,j=l,2  (2-2) 

and  we  assume  (to  avoid  the  possibility  of  singular  detection) 
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Kii(t,T)  =  K^Ct.T)  +  Ni6(t-T)  (2-3) 

with  Ni^O.  Note  that  we  do  not  assume  that  the  noise  processes  are  uncorre- 

n 

lated  between  sensors,  i.e.,  we  allow  Kij(t,T)  to  be  nonzero  for  i^j . 

The  approach  we  will  use  to  determine  the  optimal  distributed  detection 
laws  consists  of  expanding  the  received  waveforms  in  a  Karhunen-Loeve  (K-L) 
expansion  [1]  and  considering  the  problem  formed  by  truncating  the  infinite 
series  of  K-L  coefficients  to  the  first  K  terms.  The  truncated  problem  can  be 
approached  via  the  technique  of  [5]  and  results  for  the  waveform  problem 
obtained  by  taking  the  limit  as  K-*-®. 

Thus  we  expand  y^Ct)  via* 


K  k  k 

yi(t)  =  lira  Z  yi(t>i(t)  ,  To<t<Tf 
K-*-®  k=l 


k 

where  the  4ii(t)  satisfy 


k  k  c  k 

^i<^i(t)  =  /  Ki ( t  ,u ) ^i(u)du  ,  To<t<Tf  . 

To 


Under  we  have 


(2-4) 


(2-5) 


k  k  k 

yi  =  Si  +  ni  (2-6) 


where 


*x=lim  xj^  is  defined  to  mean  lim  Bx-x^  11^=0. 
k-^»  k-K® 
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k  Tf 

Si  A  /  JeI  Si(t)(|)i(t)dt 

»T»  » 


(2-7) 


It  It 

"i  A  /  ni(t)(|>i(t)dt  , 
Tn 


(2-8) 


while  under  H®  we  have  that 


k  k 
Vi  =  ni 


(2-9) 


It  is  straightforward  to  show  that 


E{ni}  =  0 


(2-10) 


k  k  2 

E{(ni)2}  =  Xi  +  Ni 


(2-11) 


and  that  the  K-L  coefficients  corresponding  to  each  sensor  are  uncorrelated; 


2,  k 

E{nini}  =  0  £^k  . 


(2-12) 


However,  the  K-L  coefficients  for  the  two  sensors  are  correlated,  that  is, 

Tf  Tf 

E{(yJ-E{yJlHj})(yk-E{yk|Hj}|Hj}  =  /  /  .|,5(t)Kn2(t  ,u)  (fk(u)dtdu  A  c^k  , 


j=0,l  ,  Jl^k 


(2-13) 


which  is  not  zero  in  general. 
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Using  the  results  of  [5]  we  can  determine  (implicitly)  the  optimal  dis¬ 
tributed  decision  law  for  the  problem  based  on  the  first  K  coefficients  of  the 
K-L  expansion.  Define 


and 


K  1  2  K 

yj.  =  lYi.yi.***  .Vil 


K  K 

AiC^i)  =  n  exp 
k=l 


IV  O  EV  IV 

(si)  -2yiSi 
2  ( ) 


(2-lA) 


(2-15) 


The  optimal  decision  law  is  then  given  by  the  solution  of  the  following  two 
equations : 

0 

A  (yK)  >  -  (2-16a) 

<  aJj+aj^  /.../  p(y^|y^,H0)d2^ 

y2 1^2=0 


AJg+AJ^  /•••/  P(y^|y|.Hl)dyK 

K| 

0  y  lu  =0 

^  1  1 

A  (yK)  >  - -  ,  (2-16b) 

<  AJg+AJ^^  /.../  p(yK|yK,H0)d2K 

2K|u^=0 

where  the  AJ^  are  given  by  the  following  equations: 
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AJ^  =  J(0,1,h1)  -  J(1,1,h1)  (2-17a) 

AJ^  =  J(0,0,h1)  +  J(1,1,h1)  -  J(I,0,h1)  -  J(0,I,h1)  (2-17b) 

AJ3  =  Jd.l.HO)  -  J(0,1,hO)  (2-17c) 

AJ^^  =  J(1,0,hO)  +  J(0,1,hO)  -  J(0,0,hO)  -  J(1,1,hO)  (2-17d) 

AJg  =  Jd.O.H^)  -  J(1,1,h1)  (2-17e) 


and 

AJg  =  JCI.I.hO)  _  j(i,o,hO)  .  (2-17f) 


Note  that  the  decision  law  for  each  sensor  is  required  to  determine  the 
region  of  integration  for  the  right-hand  side  of  Eq.  2-16.  Thus,  the  deter¬ 
mination  of  the  optimal  distrihuced  decision  law  requires  the  solution  of 
coupled  nonlinear  functional  equations.  No  general  analytic  solution  to  these 
equations  has  been  determined  and  numerical  techniques  do  not  seem  to  be  com¬ 
putationally  feasible.  In  general,  it  is  necessary  to  assume  some  special 
form  for  the  local  signal  processing  (e.g.,  a  likelihood  ratio  computation) 
and  optimize  the  thresholds.  However,  in  certain  special  cases  more  can  be 
said . 
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2.2  SPECIAL  CASES 
2.2.1  Uncorrelated  Noise 

The  easiest  case  to  consider  is  that  in  which  the  noise  is  uncorrelated 
between  sensors,  i.e.,  Under  this  assumption  we  have  that 


K  K  K 

P^ZilVj.H^)  =p(^|H^)  i,j  =  l  ,2,i^tj  ,i=0,l 


and  thus  Eq.  2-16  becomes 


AJj+AJ^  /•••/  p(y^|Hl)dyK 


y  u  =0 
"2  2 


A  (yK)  >  -  A  TK 

<  AJg+AJ^  /•••/  p(X^|H°)dyK 


yj 1^2=0 


(2-18) 


(2-19) 


AJg+AJ2  /.../  p(yK|Hl)dx^ 


u  11 

A,(yK)  >  -  A  tK  .  (2-20) 

<  AJj+AJ^  /.../  p(y^|H0)dxK 

yK|u^=0 


Note  that  the  right-hand  sides  of  the  above  equations  do  not  depend  upon 

K 

the  local  observations,  but  are  constants,  which  we  define  as  .  Thus,  the 
optimal  distributed  decision  law  is  given  by  a  pair  of  local  likelihood  ratio 
tests  where  the  thresholds  are  implicitly  defined  by  Eqs.  2-19  and  2-20. 

If  we  take  the  limit  as  K-*-«  then  one  can  show  (see  [1])  that  the  optimal 
decision  laws  can  be  written  as 


11 


ALPHATECH,  INC 


Tf 

/  yi(t)gi(t)dt 

To 


^  1  ^ 
0  “ 


Si(t)gi(t)dt 


Ti 


(2-21) 


where  the  left-hand  side  of  Eq.  2-21  is  determined  by 

Tf 

gi(t)  =  /  Qi(t ,u)si(u)du 

To 


and  Qi(t,u)  is  defined  by 


/  Kii(t ,u)Qi(u,v)du  =  6(t-v) 
To 


(2-22) 


(2-23) 


The  right-hand  side  of  Eq.  2-21  is  a  constant  (observation-independent) 
threshold  and  the  satisfy  the  following  nonlinear  coupled  algebraic 
equations ; 


(  AJ  +AJ  [  1-erf  (  (  t  +m  ) /  V 2ni  )  J  ) 

\  1  2  2  2  2  /  pi 

X  =  £n  .  -  .  — 

I  1-erf ((T2-m2)/y2m2)]  I  p° 


(  AJ  +AJ  [  1-erf ((x  -hn  )]  ) 

\  5  2  1  I  ^  1  (  pi 

T  =  in  V  -  ,  — 

I  AJg+AJ^[l-erf((T^-mj)/y2mj)]  1  pO 


where 


®i  A 


Tf 

/ 


Si(t)gi(t)dt  . 


(2-24) 


(2-25) 


(2-26) 
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Note  that  in  general,  Eqs.  2-24  and  2-25  do  not  have  a  unique  solution 
and  thus  the  optimal  decision  law  must  be  determined  by  computing  all  solu¬ 
tions  and  then  comparing  their  performance.  The  decision  law  of  Eq.  2-21  ex¬ 
tends  the  results  of  [5]  to  the  case  of  waveform  observations  and  this  law  can 
be  interpreted  as  a  pair  of  local  likelihood  ratio  tests  with  thresholds  which 
are  jointly  optimized  according  to  a  system-wide  measure  of  performance. 


2.2.2  Linearly  Dependent  Signals  and  Correlated  White  Noise 

In  this  special  case,  we  assume  that  s^=S2^s  so  that  the  observations  are 
given  by 


yi(t)  =  y/E'  s(t)  +  ni(t)  0<t<T 
H°:  yi(t)  =  ni(t)  0<t<T  . 


(2-27) 


Furthermore  we  assume  that  the  ni(t)  are  zero-mean  unit  spectral  height,  white 
Gaussian  noise  processes  where 


E{n^(t)n2(T)}  =  p6(t-T)  . 


(2-28) 


As  usual  we  expand  the  received  waveform  in  a  K-L  expansion  where  now  we 
k 

choose  the  i^i(t)  to  be  any  complete  orthonormal  set  such  that 

4,k  .  ^k(t)  =  ^k(t)  o<t<T  (2-29) 


and 


(I)l(t)  =  s(t)  0<t<T  . 

It  is  easy  to  verify  that  if 


(2-30) 
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^  k  k 

yi(t)  =  lim  I  yi4.  (t) 
K+oo  k=  1 


(2-31) 


then 


1  _  (  +  wi  :  Hi 

i  I  Wi  :  hO 


(2-32) 


k  Wi  :  Hi 

yi  =  k=2,...  (2-33) 

I  Wi  :  rO 


where 


T 

Wi  A  /  4|i(t)ni(t)dt 
”  0 


and 


E{wkwi}  =  p6(k-Jl)  .  (2-34) 

k 

Note  that  the  coefficients  yj  for  k>2  have  the  sane  statistics  under  both 

hypotheses  and  are  independent  of  the  y^.  Thus  any  optimal  decision  law  will 

use  only  the  y^  to  decide  between  the  hypotheses  H®  and  rI ,  The  y^  are  thus 
i  i 

scalar  sufficient  statistics  for  optimal  decisionmaking  in  the  team  decision 
problem  with  information  pattern  defined  by  Eqs.  2-27  and  2-28. 

Any  optimal  decision  law  can  therefore  be  specified  for  this  problem  by 
defining  the  regions  of  the  real  line  in  which  y|  must  lie  for  a  decision  of 
Rl  to  be  made.  These  regions  can  be  specified  by  their  endpoints,  and  thus 
the  decision  law  can  be  characterized  by  a  set  of  endpoints,  or  thresholds. 
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for  each  detector.  The  optimal  distributed  decision  law  is  determined  by 

21-1  21  l^i 

choosing  collections  of  thresholds  {(T^  ]}ji=i,  1=1.2  as  follows: 

2i  T  2£+l 

/  1  if  <  J  Si(t)yi(t)dt  <  Ti  ,  £=1,...,L 

ui  =  .  (2-35) 

\  0  otherwise 


Necessary  conditions  for  the  optimality  of  the  thresholds  are  readily 
developed  if  is  specified.  Since  the  number  of  such  thresholds  is  arbi¬ 
trary  one  must  compare  the  performance  of  laws  with  different  L^'s  to  deter¬ 
mine  the  optimal  distributed  detection  law.  We  have  calculated  optimal  laws 
based  on  the  assumption  that  L^=L2=L=1  or  3.*  In  all  cases  the  laws  based  on 
the  assumption  that  L=1  proved  superior.  While  we  have  been  unable  to  prove 
that  L=1  is  optimal,  we  will  only  consider  that  case  in  the  sequel. t 
The  optimal  detection  law  for  the  L=1  case  is  given  by 


T  1 

/  s(t)yi(t)dt  >  Ti  (2-36) 

0  < 

0 


wherre  T^  and  T^  satisfy 


*Clearly  Ui=0  as  y^+-®  and  Ui  =  l  as  yl++<«>  so  that  an  even  number  of  intervals 
i  i 

is  required  and  thus  L  must  be  odd. 

tNote  that  the  necessary  conditions  for  optimality  consist  of  2L  nonlinear 
coupled  equations;  the  assumption  of  L=1  leads  to  considerable  reduction  in 
the  computational  burden. 
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= 


/e~  (  AJ 

_L  1  '  ' 

^  "I 


+AJ  erf 
2 


“) 


(2-37) 


AJg+AJ^  erfl(T2-pT^)/yi-p2] 


/AJ+AJ  erf((T  -^/iT-pCT  -./F') )/ V^l-p^ 
1  l52  l\l  2  \  2 

-  In  - 

AJ^+AJ^  erfKT^-pT^)/-^!^] 


(2-38) 


For  the  special  cases  considered  above  the  "local"  likelihood  ratio  is  a 
sufficient  statistic  for  optimal  distributed  decisionmaking.  More  generally, 
coupled  functional  equations  must  be  solved  to  determine  the  optimal  distrib¬ 
uted  detection  law.  However,  given  the  difficulty  of  solving  such  equations 
and  the  above  results,  a  reasonable  choice  for  a  distributed  detection  law  is 
to  use  local  likelihood  ratio  tests  with  globally  optimized  thresholds.  Note 
that  this  choice  implies  that  the  same  signal  processing  can  be  used  whether  a 
sensor  is  used  alone  or  as  part  of  a  surveillance  network  -  a  strong  practical 
requirement!  In  Section  3  we  will  take  this  approach  when  considering  the 
detection  of  an  unknown  signal  in  noise. 

2.3  NUMERICAL  EXAI-IPLES 

Here  we  consider  an  example  involving  a  scalar  observation  with  corre¬ 
lated  noise.*  First  we  introduce  the  observation  model  and  rewrite  the  neces¬ 
sary  conditions,  then  in  subsections  2.3,1  and  2.3.2  we  consider  two  different 
global  cost  functions. 


*The  special  case  of  uncorrelated  noise  reduces  to  the  scalar  observation 
problem  studied  in  [5]  and  thus  we  do  not  consider  it  here. 
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where  the  are  solutions  to  Eqs.  2-37  and  2-38. 

2.3.1  Bayesian  Formulation 

We  assume  that  the  thresholds  are  to  be  selected  so  as  to  minimize  the 
expected  Bayes  cost  where 

(0  if  u  =u„=H 

j  =  .  1  if  .  ’■  --42) 

\  k  if  u^=U2^H 


This  type  of  cost  criterion  arises  in  situations  where  it  is  not  precisely 
twice  as  costly  to  make  two  errors  as  it  is  to  make  one.  For  example,  if 
weapons  are  collocated  with  sensors  and  targets  are  automatically  attacked 
when  detected,  then  having  two  missed  detections  may  be  much  more  serious  than 
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having  one  missed  detection.  This  situation  would  be  modeled  by  choosing  k.>2, 
so  that  a  double  error  is  more  than  twice  as  costly  as  two  single  errors. 

For  J  defined  by  Eq.  2-42,  Eqs.  2-37  and  2-38  become 


(2-43) 


(2-44) 


and  we  note  that  the  thresholds  for  the  locally  optimal  test*  (T 
T^-VEI/2)  satisfy  Eqs.  2-43  and  2-44,  This  can  be  seen  by  noting  that 


*That  is  the  test  with  thresholds  that  would  be  optimal  for  each  sensor  con¬ 
sidered  in  isolation  when  a  minimum  probability  of  error  criterion  is  used. 
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Figure  2-1  is  a  plot  of  the  Bayes  cost  as  a  function  of  k  for  p=0  and 
p=0.5.  Note  that,  for  larger  p,  the  optimal  DLRT  solution  can  be  much  better 
than  the  LOT.  This  is  because  the  optimal  solution  skews  the  thresholds  to 
avoid  the  double  errors  which  are  more  common  for  larger  p  and  more  costly  for 
larger  k. 

Note  that  for  the  LOT  the  cost  increases  as  an  affine  function  of  k  since 
the  probability  of  a  double  error  is  independent  of  k.  The  expected  cost  for 
the  DLRT  never  exceeds  1  since  that  value  can  be  obtained  by  a  suboptimal  law 
with  thresholds  set  so  that  one  detector  always  decides  0  while  the  other 
always  decides  1. 

Figure  2-2  is  a  plot  of  th,e  Bayes  cost  as  a  function  of  p  for  k=5.  Again 
we  see  that  as  p  increases  the  DLRT  becomes  much  better  than  the  LOT.  This 
occurs  because,  as  p-^1 ,  the  probability  of  a  double  error  (for  fixed  thresh¬ 
olds)  increases.  The  DLRT  solution  skews  the  thresholds  to  decrease  the  prob¬ 
ability  of  a  double  error.  This  yields  a  27  percent  decrease  in  cost  over  the 
LOT. 

These  results  indicate  that  in  some  cases  there  is  a  significant  gain  to 
be  had  by  using  the  optimal  decentralized  likelihood  ratio  test  rather  than  a 
naive  approach  which  ignores  the  correlation  between  sensors. 

2.3.2  Surveillance  System  Design 

In  this  subsection  we  assume  that  a  local  decision  u^  based  only  on  the 
information  provided  by  sensor  i  is  sent  to  a  fusion  center  where  a  global 
decision  u  is  made.  We  assume  that  the  fusion  center  decision  rule  is  defined 
as  u*l  if  and  only  if  u^=U2=l.  We  display  the  overall  performance  of  such  a 
surveillance  system  via  a  generalized  network  receiver  operating 
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characteristic  (ROC)  curve,  which  is  a  plot  of  the  surveillance  network's 
probability  of  detection  versus  probability  of  false  alarm.  In  addition  to 
plotting  the  performance  of  the  DLRT  we  also  plot  the  performance  of  the  opti¬ 
mal  centralized  detection  law.  This  allows  us  to  determine  how  the  perfor¬ 
mance  of  the  surveillance  system  is  degraded  by  requiring  that  only  local 
decisions  (processed  information)  rather  than  raw  sensor  data  be  transmitted 
to  the  fusion  center. 

The  ROC  can  be  obtained  by  varying  the  ratio  of  the  cost  of  a  false  alarm 
to  the  cost  of  a  missed  detection.  If  we  let  the  false  alarm  cost  be  unity 
and  the  missed  detection  cost  be  a  then  the  necessary  conditions  for  optimal¬ 
ity  become 


f.  1  1 

j  1-erf  pT^-^-p(Y^E^))/Vl-e2 

'  'fl  1 

1  1-erf 

.  (2-49) 


Figure  2-3  depicts  the  performance  of  the  optimal  centralized  test  and 
that  of  the  DLRT  for  the  case  where  the  sensors  are  identical, 

We  see  that  as  p  increases  the  centralized  and  DLRT  results  become  more  and 
more  similar.  Figure  2-3  also  shows  that  as  p  increases  the  performance 
degrades . 
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Note,  however,  that  the  performance  of  the  DLRT,  while  more  closely 
approaching  that  of  the  centralized  test,  degrades  as  This  can  be  under¬ 

stood  by  noting  that  if  p  is  the  probability  of  a  local  decision  being  wrong 
then  the  probability  of  a  double  error  is  approximately  p2  when  p=0  but  is  p 
when  p=l.  Since  the  fusion  center  always  makes  an  incorrect  decision  when 
both  local  decisions  are  wrong,  the  performance  degrades  as  p+1 . 

The  phenomenon  of  the  DLRT  and  centralized  results  growing  closer  togeth¬ 
er  as  p  increases  is  not  universal.  Figure  2-4  illustrates  the  behavior  of 
these  two  decision  laws  for  the  case  of  asymmetric  sensors,  Vh-  1  and  yfr^.2. 
We  see  that  for  the  DLRT,  the  performance  degrades  as  p-»-l .  The  reason  is 
exactly  tiie  same  as  in  the  case  of  ^£^  =  .^£^=1 .  For  the  centralized  case  how¬ 
ever,  the  performance  becomes  perfect  as  p+l .  This  occurs  because  by  differ¬ 
encing  the  sensor  observations  one  has 

H^:  Ay(t)  =  y^(t)  -  y^Ct)  =  ( Y/7-l)sin(2Trt)  +  n^Ct)  -  n^Ct)  (2-50) 
Ay(t)  =  y^(t)  -  y^(t/  =  n^(t)  -  n^Ct)  .  (2-51) 

As  p+1  ,  Ay+( Y/^~nsin(2iTt)  if  is  true  and  Ay+0  if  is  true,  thus 
perfect  detection  is  possible. 

These  graphs  illustrate  that  the  performance  difference  between  a  DLRT 
and  a  centralized  test  is  strongly  dependent  on  the  problem  being  considered. 
Similarly,  the  benefit  in  reduction  of  communications  requirements  achieved  by 
using  a  DLRT  rather  than  a  centralized  test  is  highly  problem  dependent.  It 
is  thus  the  case  that  detailed  analysis  is  required  to  determine  whether  a 
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SECTION  3 

UNKNOWN  SIGNAL  IN  NOISE 


In  this  section  we  consider  a  distributed  detection  problem  in  which  the 
signal  is  a  random  Gaussian  process  and  the  noises  are  white  Gaussian  proces¬ 
ses.  Again  we  consider  only  the  case  of  two  detectors  and  binary  hypotheses. 

We  assume  that  the  i-th  sensor's  observation  under  the  two  hypotheses  is 
modeled  by: 


:  yi(t)  =  CiSi(t)  +  ni(t)  O^tcT 

hO:  yi(t)  =  ni(t)  0<t<T 


(3-1) 


where  the  n^Ct)  are  independent  zero-mean  unit  spectral  height,  white  Gaussian 
noise  processes  and  the  signals  Si(t)  are  zero-mean  unit  power  Gaussian  pro¬ 
cesses  with  known  covariances: 


s 

E{si(t)sj(T)}  A  Kij(t,T)  0<t,T<T  .  (3-2) 

s 

If  we  expand  the  yi(t)  in  K-L  series  (generated  by  Kii(t,T))  then  the  op¬ 
timal  distributed  detection  test  is  given  by  Eqs.  2-16  and  2-17,  where  the 
Ai(yi)  are  now  quadratic  functions  of  the  observations  yi(t).  Unlike  the 
known  signal  in  noise  case,  all  the  K-L  coefficients  have  statistics  which 
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depend  upon  which  hypothesis  is  true.  Thus  none  of  the  simplifications  possi¬ 
ble  in  subsection  2.2  can  be  applied  -  to  determine  the  optimal  detection  law, 
coupled  nonlinear  functional  equations  must  be  solved. 

Since  we  cannot  solve  the  equations  defining  the  optimal  distributed 
detection  test  we  look  instead  for  a  good  alternative.  The  test  we  consider 
is  that  motivated  by  the  previous  section:  the  distributed  likelihood  ratio 
test  (DLRT).  These  tests  are  defined  by  the  equation 

1 

£i  >  Ti  ,  i=l,2  (3-3) 

0 

where  is  the  (local)  log-likelihood  ratio  and  the  T^  are  optimized  for  the 
best  global  surveillance  system  performance.  This  class  of  tests  is  not  easy 
to  analyze  for  the  problem  of  detecting  an  unknown  signal  in  noise.  The  dif¬ 
ficulty  arises  because  the  local  log-likelihood  ratios  are  not  Gaussian.  In 
the  discussion  below  we  consider  a  problem  for  which  we  are  able  to  obtain 
results . 


3.1  IDEAL  BANDLIMITED  SIGNAL 

Consider  a  problem  in  which  the  observations  are  modeled  by* 
:  yi(t)  =  Cis(t)  +  ni(t)  0<t<T 
H°:  yi(t)  =  ni(t)  0<t«;T 


(3-A) 


*The  bandpass  version  of  this  problem  is  readily  treated  in  an  entirely  analo¬ 
gous  fashion;  such  a  formulation  is  a  reasonable  model  for  passive  detection 
of  a  radio  transmitter  operating  at  a  known  frequency  and  with  a  known 
bandwidth. 
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where  s(t)  is  a  zero-mean  Gaussian  stochastic  process  of  unit  power  with  an 
ideal  bandlimited  spectrum  (Fig.  3-1)  and  the  n^Ct)  are  zero-mean  unit  spec¬ 
tral  height,  Gaussian  white  noise  processes.  We  denote  the  covariance  of  the 
signal  by  KS(t  , 


S(f) 


R-05i: 


Figure  3-1.  Signal  Spectrum. 

To  determine  the  performance  of  the  DLRT  we  first  determine  the  form  of 
the  local  likelihood  ratios  and  then  calculate  their  joint  probability  distri¬ 
bution  function  (conditioned  on  each  hypothesis). 

3.1.1  Local  Log-Likelihood  Ratio 

If  we  expand  the  received  signal  via  a  K-L  series, 

K 

yi(t)  =  lim  r  y*^<|)^(t)  0<t<T  (3-5) 

¥.*<*>  k=l  i 

where  the  <J>*^(t)  satisfy 
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T 

Xk^k(t)  =  J  KS(t,u)4>*^(u)du  0<t<T 

0 


(3-6) 


it  is  easy  to  show  [2]  that  the  log-likelihood  ratio  based  on  the  first  K 
coefficients  at  the  i-th  sensor  is  given  by 


K  i 

E  - 

k=l  l+c2Ak 

i 


K 

E  inCl+c^x'^) 
k=l  i 


(3-7) 


Since  the  thresholds  are  constants  in  Eq.  3-3  we  shall  henceforth  work  with 

only  the  data  dependent  portion  of  which  we  define  as 

i 


1  K 

£K  A  -  z 
i  2  k=l 


1  l+c^X^, 

\  i  / 


(yk)2 

i 


In  the  limit  as  we  obtain  [2] 


(3-8) 


1  T  T 

^i  '  ~  /  I  yi(t)hi(t,u)yi(u)dudt  (3-9) 

2  0  0 


where  hi(t,u)  satisfies 


hi(t,u)  +  /  c^hi(t,z)K®(z,u)dz  *  c^KS(t,u)  ,  0<t  ,u<T 

0  i  i 


(3-10) 
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3.1.2  Joint  Probability  Distribution  Function 

The  derivation  of  expressions  for  the  joint  probability  distribution 
functions  of  and  conditioned  on  H®  and  is  complex  and  thus  the 
details  are  relegated  to  Appendix  A.  Here  we  briefly  sketch  the  derivation. 

We  first  compute  the  joint  moment  generating  function  of  the  £^.  This  is 
relatively  straightforward  as  the  y^  are  (for  each  i)  conditionally  indepen¬ 
dent.  We  then  take  the  limit  as  K+«.  If  we  define 


-r£  -s£ 

lii(r,s)  A  E{e 


(3-11) 


then  we  have 


£n  u  (r,s)  =  -  Z  £n 
°  2  k=l 


l+c^x'^ 


l+( 


,,  2,k 
1+c  A 
2 


l+(l+s)c2x*^j 


(3-12) 


£n  p^(r,s)  = 


1  “ 

-  Z  £n 
2  k=l 


(1-p^) 


( l+r(  l-p^jc^xl^)  ( l+s(  l-p2)c2x^)-p2 
k  1  k  2  k 


(3-13) 


where 


c  c  X 
1  2 


Pk  A 


V(l+c2xk)(l+c2xk) 


(3-14) 
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If  the  observation  time  is  long  in  comparison  to  the  signal  time  con¬ 
stants  (i.e.,  if  WT  is  large  enough)  we  have  [1]  that  the  infinite  sums  in 
Eqs.  3-12  and  3-13  can  be  replaced  with  integrals  in  which  the  eigenvalue  mag¬ 
nitude  appearing  in  Eqs.  3-12  and  3-13  is  replaced  by  the  signal’s  spectral 
height.  This  yields  (for  the  ideal  bandlimited  signal)  a  trivial  integral  and 
we  have 


l+c^/2W 

WT 

l+c^/2W 

1 

2 

l+(l+r)c2/2W 

1+(1+s)c2/2W 

Wj(r,s) 


l-p2 


WT 


Cl+r(l-p2)c2/2W)(l+s(l-p2)c2/2W)-p2 


(3-16) 


where 


c  c  /2W 
1  2 

p  =  -  .  (3_17) 

V(1+c2/2W)(1+c2/2W) 


We  can  then  invert  each  moment  generating  function  to  obtain 


P(£j,A2|hO)  = 


1+Y  1+Y 

1  2 


-lA/2  ,  ,4/2-1  . 

1  2 


1  - 

CM  1 

+ 

\  y 

1 

\  ^2  / 

(r(A/2))2 


(3-18) 
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JL 

i 

f”  O  “I  -A/ 2 

1 

2  r- 

(l-p2)Y  Yj  , 

.  ^  (l-p^)Y  Y 

m  P 

i-p^)y^  ( 

l-p2)Y2  >12  1 

*  iA/2-l  2 


— ; — 
(I-p2)YjY2 


(3-19) 


where  we  have  defined  A=2WT  as  the  observation  time-signal  bandwidt  .  product, 

Ei=c2T  as  the  signal  energy  received  by  the  i-th  sensor  and  Yi=Ei/A  3s  the 
i 

signal-to-noise  ratio  in  the  signal  bandwidth  (recall  the  noise  had  unit  spec¬ 
tral  height)  and  where  Iv(*)  is  the  modified  Bessel  function  of  order  v. 

Surprisingly,  both  Eqs.  3-18  and  3-19  can  be  integrated  analytically  when 
A/2-1  is  a  nonnegative  integer.  We  thus  obtain  under  this  assumption  the 
joint  distribution  functions: 


Pr{i^<T^,£2<T2|H0}  = 


-6i  A/2-1 
1-e  r  6r/r! 

r=0  ^ 


-6,  A/2-1 
1-e  ^  Z  6^/r! 

r=0  ^ 


(3-20) 


where 
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Pr{A^<T^,A2<T2|Hl}  = 


»  k+A/2-1! 

A/ 2-1 

r” 

1 

CM 

< 

1 _ 

(l_p2)A/2  1  -  p2k 

1-  Z 

r=0  ^ 

1-  I  B^r! 

k=0  k!A/2-l! 

r=0  2 

(3-22) 


where 


Ti 

6i  =  -  . 

( l-p^)c2 
i 


(3-23) 


Via  Eqs.  3-20  and  3-22  we  can  evaluate  the  performance  of  the  DLRT  for 
any  pair  of  thresholds.  It  is  thus  straightforward  to  determine  via  numerical 
techniques  the  optimal  thresholds  for  any  given  problem.  Results  are  given  in 
the  next  section. 

3.2  r;u;iEKICAL  examples 

Here  we  will  present  numerical  results  for  the  distributed  detection  of 
an  ideal  bandlimited  signal  in  white  Gaussian  noise.  Both  the  general 
Bayesian  case  and  the  case  of  surveillance  system  design  will  be  considered. 

3.2.1  Bayesian  Formulation 

Using  the  cost  function  defined  in  subsection  2.3.1  we  obtain  the  results 
depicted  in  Figs.  3-2  and  3-3.  In  Fig.  3-2  we  have  plotted  the  expected  cost 
of  the  optimal  DLRT  and  that  of  the  locally  optimal  test*  (LOT)  as  functions 


*For  the  cases  considered  here  the  sensors  are  identical  and  we  define  the  LOT 
as  the  DLRT  satisfying  the  additional  constraint  that 
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of  the  double  error  cost  k.  The  different  curves  correspond  to  different 
values  of  signal  energy  (£^=1 ,  10,  20,  40,  100),  wher^  the.  time-bandwidth 
product  has  been  held  constant  (A=10). 

We  note  that  identical  results  are  obtained  for  k<2,  but  that  the  costs 
associated  with  the  LOT  become  affine  for  large  enough  k.  This  occurs  because 
when  T^=T2  the  probability  of  a  double  error  cannot  be  reduced  to  zero.  For 
large  enough  k,  the  cost  of  double  errors  dominates  the  growth  of  the  expected 
value  of  the  cost  EJ  and  the  affine  curves  result.  The  cost  for  the  optimal 
DLRT  of  course  never  takes  on  a  value  greater  than  unity  (since  taking  ^^=1 
and  U2=0  gives  J<1).  As  the  signal  energy  is  increased  the  performance  of  the 
DLRT  improves  and  the  point  at  which  the  LOT  is  not  longer  globally  optimal  is 
also  increased.  This  latter  effect  occurs  because,  as  the  signal  energy  in¬ 
creases,  the  probability  of  a  double  error  decreases  rapidly.  This  means  that 
the  cost  is  dominated  by  the  cost  of  single  errors  until  k  becomes  very  large. 

In  Fig.  3-3  we  again  plot  the  performance  of  the  LOT  and  the  DLRT  versus 
k,  but  now  signal  energy  is  held  constant,  Ei=20,  and  the  time-bandwidth  prod¬ 
uct  is  allowed  to  vary  (A=2,  10,  100).  Here  we  see  the  effect  of  frequency 
diversity  -  for  small  k  the  LOT  is  optimal  and  the  DLRT  performance  for  A=10 
is  better  than  either  A=2  or  A=100  (as  also  in  Figs.  3-4  and  3-5).  As  k  in¬ 
creases,  the  effect  of  double  errors  increases  and  DLRT  performancf  is  domi¬ 
nated  by  the  need  to  minimize  double  errors.  For  large  k  the  DLRT  skews  the 
thresholds  so  that  one  detector  is  likely  to  have  u=0  and  the  other  have  u=l. 
This  strategy  is  so  effective  that,  for  the  cases  of  Fig.  3-3  with  k>20,  the 
cost  of  double  errors  is  less  than  5  percent  of  the  total  cost.  The  structure 
of  the  Bayes  cost  EJ  for  large  k  is  thus  determined  by  the  probability  of 
single  errors  when  the  thresholds  are  skewed  to  avoid  double  errors. 
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Figure  3-5 
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3.2.2  Surveillance  System  Design 

Now  we  present  results  for  the  case  described  in  subsection  2.3.2,  i.e., 
the  case  in  which  a  global  decision  is  made  at  a  fusion  center  by  "adding" 
together  the  local  decisions  made  from  the  observations  of  the  individual 
sensors . 

In  Figs.  3-4  and  3-5  we  plot  the  probability  of  a  miss  (i.e.,  that  is 
decided  when  is  true)  as  a  function  of  the  time-bandwidth  product  A=2WT, 
for  fixed  probability  of  false  alarm  (i.e.,  that  is  decided  when  H®  is 
true).  In  Fig.  3-4  we  have  set  Pp=0.1  and  in  Fig.  3-5  we  have  set  Pp=0.001. 
The  various  curves  represent  the  performance  of  the  DLRTs  and  centralized  LRTs 
for  different  signal  energies. 

The  curves  labeled  Ej  plot  the  DLRT  performance  for  two  sensors  each  of 
which,  under  ,  observes  a  signal  of  energy  Ej.  The  curves  labeled  Eq  plot 
the  performance  of  the  optimal  centralized  likelihood  ratio  test  where,  under 
H^,  the  total  signal  energy  received  is  E^.*  In  both  figures  we  see  the  an¬ 
ticipated  ranking  of  performance  curves:  Ec<Ed<2Ec.  This  occurs  since  the 
DLRT  for  the  two  sensor  case  must  perform  at  least  as  well  as  the  one  sensor 
centralized  LRT  and  cannot  perform  as  well  as  the  two  sensor  centralized  LRT. 

In  both  these  figures  we  see  that  the  performance  initially  improves  as  A 
increases  and  then  degrades.  This  is  basically  due  to  the  effect  of  frequency 
diversity.  We  know  [2]  that  most  of  the  signal  energy  is  associated  with  the 
first  2A+1  eigenfunctions  4i*^(t).  Thus  as  A  increases  the  number  of  signifi¬ 
cant  independent  "observations"  of  the  signal  increases,  however,  the 


*For  this  problem  the  centralized  performance  of  one  sensor  receiving  E^  is 
exactly  equivalent  to  that  of  two  sensors  each  receiving  E(,/2. 
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signal-to-noise  ratio  (SNR)  for  each  "observation”  decreases.  Thus  these 
figures  plot  the  tradeoff  of  number  of  observations  versus  SNR  per 
observation. 

We  note  from  Figs.  3-4  and  3-5  that  the  performance  of  the  optimal  DLRT 
seems  to  be  more  similar  to  that  of  the  one  sensor  centralized  LRT  than  to 
that  of  the  two  sensor  LRT.  In  Fig.  3-6  we  consider  the  relative  performance 
of  these  systems  more  closely.  We  compare  the  centralized  and  decentralized 
systems  by  determining  the  amount  of  signal  energy  required  by  a  centralized 
system  to  perform  as  well  as  a  decentralized  system.  The  ratio  E^/Ed  is 
largest  when  the  decentralized  system  performs  well  and  is  smallest  when  it 
performs  poorly.  We  take  as  baseline  the  performance  of  the  optimal  DLRT  with 
Ed  =100  and  Pf=0.001  and  determine  the  E^  required  to  obtain  the  equivalent  Pm. 
Note  that  for  4=2,  Ec/Ed=1.40  and  thus  the  two-sensor  DLRT  performs  as  well  as 
1.40  centralized  sensors.  As  A  increases  the  DLRT  performance  degrades  until 
at  A=100  we  have  Ec/Ed=1.25. 

The  performance  of  the  DLRT  begins  to  improve  as  2WT  increases  above  100. 
The  asymptotic  performance  has  not  been  determined  as  optimal  performance  has 
not  been  determined  analytically  and,  for  2WT>100,  numerical  difficulties  in¬ 
trude  into  the  computation  of  Pj-j  and  Pp. 

In  Fig.  3-7  we  plot  the  performance  of  DLRTs  for  different  combinations 

i 

of  sensors  with  Pf=0.001.  If  we  let  Ej  denote  the  signal  energy  received  by 

I  2 

the  i-th  sensor  than  for  all  curves  we  have  E  +E  =200.  The  different  curves 

d  d 

1  2 

correspond  to  various  ratios  E  /E  .  We  note  that  the  performance  increases  as 

d  d 

the  sensors  become  more  asymmetric.  This  occurs  since  we  are  effectively 

1  2 

moving  toward  a  centralized  solution  (note  that  at  E  =0,  E  =200  the 

d  d 
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1  2 

performance  is  identical  to  that  of  Ec=200).  When  E  =0  and  E  =200 

d  d 

threshold  is  zero  so  that  u^=l  always  and  the  second  threshold  is 
to  that  associated  with  E(.=200. 


the  first 
identical 
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SECTION  4 

SUIUIARY,  CONCLUSIONS,  AND  SUGGESTED  RESEARCH 

In  this  paper  we  have  considered  the  problem  of  distributed  detection 
with  limited  communication.  We  have  obtained  optimal  detection  laws  for  the 
case  of  known  signals  in  noise  uncorrelated  between  sensors  and  for  the  case 
of  linearly  dependent  known  signals  in  white  noise  correlated  between  sensors. 
For  both  of  these  cases  the  optimal  distributed  detection  law  consists  of 
forming  local  likelihood  ratios  and  testing  these  ratios  against  thresholds  to 
determine  a  decision.  For  uncorrelated  noise  a  single  threshold  is  optimal. 
This  may  well  be  the  case  for  correlated  noise  as  well,  but  we  were  unable  to 
prove  or  disprove  this  conjecture. 

For  a  specific  example,  with  noise  correlated  between  sensors,  we  found 
that  the  distributed  detection  law  performs  worse  as  the  correlation  in¬ 
creases,  while  the  centralized  law  may  either  perform  better  or  worse.  Thus 
an  analysis  is  necessary  in  each  case  to  determine  whether  the  performance 
penalty  for  implementing  a  distributed  detection  law  is  worth  the  payoff  in 
terms  of  reduced  communications  requirements. 

We  then  investigated  the  problem  of  distributed  detection  of  an  unknown 
signal  in  noise.  This  problem  was  found  to  be  considerably  more  difficult 
than  the  case  of  a  known  signal  since  the  detection  law  depends  upon  the  en¬ 
tire  received  signal  -  a  scalar  sufficient  statistic  does  not  exist  in  gener¬ 
al.  As  the  optimal  distributed  detection  law  could  not  be  determined  we 
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investigated  laws  employing  local  likelihood  ratio  tests.  Even  for  this  case 
results  are  difficult  to  obtain  since  the  local  log-likelihood  ratios  are  not 
Gaussian.  However,  for  an  important  special  case  (long  observation  time, 
ideal  bandlimited  signals,  white  sensor  noise)  we  were  able  to  obtain  analyt¬ 
ical  results.  For  this  case  we  found  that  two  identical  distributed  sensors 
generally  perform  as  well  as  1.25  to  1.40  centralized  sensors. 

More  general  problems  of  unknown  signals  in  noise  do  not  appear  solvable 
by  using  the  methods  of  this  report  since  the  exact  probability  distribution 
function  is  generally  difficult  to  determine.  The  approach  used  in  central¬ 
ized  problems  [1],(21  is  to  generate  approximations  based  on  Chernov  bounds 
and  Gaussian  approximations  or  Edgeworth  expansions.  We  considered  several 
such  approximations  for  the  distributed  problem  -  in  all  cases  the  approxima¬ 
tions  were  too  inaccurate  to  be  useful.  A  further  drawback  to  those  approxi¬ 
mations  is  that  the  moment  generating  function  pi(r,s)  is  required  and,  as 
Appendix  A  makes  clear,  these  quantities  are  difficult  to  compute  for  all  but 
the  ideal  bandlimited  case. 

Work  in  distributed  detection  is  in  its  Infancy,  and  many  of  the  issues 
that  arise  in  the  centralized  theory  can  also  be  formulated  in  the  distributed 
setting.  Moreover,  the  increasing  tendency  to  net  sensors  together  in  mili¬ 
tary  surveillance  systems  makes  the  distributed  detection  problem  formulation 
of  potential  practical  significance.  However,  as  the  results  of  this  report 
make  clear,  the  distributed  version  of  a  given  detection  problem  is  often  sig¬ 
nificantly  more  difficult  to  solve  than  its  centralized  counter  part. 
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APPENDIX  A 

DERIVATION  OF  LRT  PROBABILITY  DISTRIBUTIONS 


In  this  appendix  we  derive  the  joint  probability  distribution  of  the  log- 
likelihood  ratios  considered  in  Eq.  3-8.  Recall  that  the  log-likelihood 
ratios  £i  are  defined  as 

K 

Aj  =  lim  (A-I) 

K-*-® 


where 


K 

li  L 


1 

2 


K 

Z 

k=l 


I  2,k 

c  .  A 


l+c^A^ 


and  the  are  the  eigenvalues  of  K®(t,u)  where 


(A-2) 


KS(t,u)  =  E{s(t)s(u))  .  (A-3) 

k 

The  Yi  are  the  Karhunen-Loeve  coefficients  associated  with  the  k-th  eigenfunc¬ 
tion  of  K®.  It  is  straightforward  to  verify  that 
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E{yi |hJ } 


i.l,2,j-0,l 
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(A-5) 
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l+c2xk 
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(A-10) 


and  similarly 


U^(r ,s) 
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(  l+r(  l-p2)c2x*'^)  ( l+s(  l-p2)c2x*^)-p^ 

If  i  W  2  L 


(A-11) 


where 


Pl^  ^  ’  ( A  1 2 ) 

‘J(l+c2xk)(l+c2xk) 

K 

The  Pi  are  not  in  a  form  which  allows  inversion  and  thus  we  simplify  Eqs. 
A-10  and  A-11.  We  do  this  by  noting  that  [1],  if 

«e 

gx  A  Z  g(Xi)  (A-13) 

i-1 
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where  are  the  eigenvalues  associated  with  K(t,u),  then  for  long  enough  ob¬ 
servation  time  T 


CO 

gx  H  T  /  g(S(f))df 

—00 


(A-14) 


where  S(f)  is  the  spectrum  of  K(t,u).  The  can  be  written  as  in  Eq.  A-13  by 

using  Eq.  A-IA  if  we  first  take  the  logarithm  and  the  limit  as  K-h».  Thus 
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where 
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y(l+c2s(f))(l+c2s(f)) 


The  integrals  in  Eqs.  A-15  and  A-16  are  readily  evaluated  for  the  ideal 
bandliraited  spectrum,  yielding  (after  exponentiation) 


PqCt.s)  = 


l+c‘^/2W 

1 


l+(l+r)c2/2W 


l+c^/2W 


1+(1+s)c2/2W 


(A-18 


p  (r,s)  =  - — — — - - 

( l+r( l-p2)c2/2W)( l+s( l-p2)c2/2W)-p2 


(A-19 


where 


/c2/2W  c2/2W 

V  1  2 

^(1+c2/2W)(1+c2/2W) 


(A-20 


We  define  Ei=c2T  as  the  signal  energy  received  by  the  i-th  sensor,  A=2WT 
as  the  observation  time-signal  bandwidth  product,  and  Yi=c2T/2WT  as  the  sig- 
nal-to-noise  ratio  in  the  signal  bandwidth.  With  these  definitions  we  have 
obtained  Eqs.  3-15  and  3-16  of  Section  3: 
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UjjCr.s)  = 


_  A/2  p  A/2 

1+Yl  1+Y2 


l+(l+r)Y  1+(1+s)y 


(A-21) 


jj  (r,s)  =  - 

(l+r(l-p2)Y^)(l+s(l-p2)Y2)-p^ 


(A-22) 


Using  a  standard  set  of  Laplace  transforms  [5]  we  can  invert  Eq.  A-21  to 


obtain 


P(£^,£2|h0)  = 


i/2  i/2-1 

1+Y  U  A  J 


(r(A/2))‘ 


(A-23) 


Inverting  Eq.  A-22  on  r  first  yields 
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(continued) 
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s+1/(y2(1"P^)) 


A/ 2 


(l-p2)Y^Y2  s+l/((l-p2)Y2) 


(A-24) 


which,  using  the  time  shift  property  of  Laplace  transforms,  yields 

£l  £2 
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The  inverse  transform  is  standard  and  yields 


p(£^,£2|Hl)  = 
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Ia/2-1  2 


(l-p2)YjY2) 


where  Iv(*)  is  the  modified  Bessel  function  of  order  v. 

To  obtain  the  distribution  functions  we  now  integrate  Eqs.  A-23  and  A-26. 
To  perform  these  integrations  we  assume  that  A/2-1  is  a  nonnegative  integer. 

We  obtain  under  H®: 
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where 
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6i  =  — Ti  ,  i=l  ,2  • 
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To  integrate  „e  expand  Iy(»)  in  a  series  and  integrate  term 


by  term.  From  [13]  we  have  that 
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where 


Ti 


(l-p2)Yi 


(A-32) 


We  thus  have  derived  closed-form  expressions  for  the  joint  probability  distri¬ 
bution  functions  as  desired. 
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